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Detecting communities in large networks has drawn much attention over the years. While mod- 
ularity remains one ol the more popular methods ol community detection, the so-called resolution 
limit remains a significant drawback. To overcome this issue, it was recently suggested that instead 
of comparing the network to a random null model, as is done in modularity, it should be compared 
to a constant factor. However, it is unclear what is meant exactly by "resolution-limit-free", that 
is, not suffering from the resolution limit. Furthermore, the question remains what other meth- 
ods could be classified as resolution-limit-free. In this paper we suggest a rigorous definition and 
derive some basic properties of resolution-limit-free methods. More importantly, we are able to 
prove exactly which class of community detection methods are resolution-limit-free. Furthermore, 
we analyze which methods are not resolution-limit-free, suggesting there is only a limited scope for 
resolution-limit-free community detection methods. Finally, we provide such a natural formulation, 
and show it performs superbly. 

PACS numbers: 89.75.Fb, 89.65.-s 



I. INTRODUCTION 

The last decade has seen an incredible rise in net- 
work studies, and will likely continue to rise [DU]. Be- 
sides the study of properties, such as degree distributions, 
clustering coefficients and average path length [3], many 
complex networks exhibit some modular structure [H [5] . 
These communities might represent different functions, 
or sociological communities, and have been successfully 
studied on a wide variety of networks, ranging from 
metabolic networks [5] to mobile phone networks [7] and 
airline transportation networks |S]. 

One of the most popular methods for community de- 
tection is that of modularity The past few years 
suggestions have been made to extend or alter the orig- 
inal definition, for example, allowing detection in bipar- 
tite networks |10j , networks with negative links |llj , and 
dynamical networks 12J. Although modularity optimiza- 
tion seems to be able to accurately identify known com- 
munity structures [13] . it suffers from an inherent prob- 
lem, namely a resolution limit |14j . which affects the ef- 
fectiveness of community detection [TS] . This resolution 
limit prevents detection of smaller communities in large 
networks, although this effect can be mitigated somewhat 
by a so-called resolution parameter [16j , which can be re- 
lated to time scales of random walks on the network [T7] . 
Another approach adds self-loops in order to circumvent 
this resolution limit problem 18J, and can similarly be re- 
lated to the random walk approach [17] . The use of such 
resolution parameters enables the investigation of com- 
munity structures at various levels of description. The 
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analysis of which levels of description are meaningful or 
relevant then becomes important, but we will not inves- 
tigate this issue here. 

Recently, a new method has been suggested that would 
not suffer from this resolution limit |19| . For showing a 
method suffers from a resolution limit a few clear cases 
suffice, but the opposite seems more difficult to argue. 
That is, although there is no problem for the cases an- 
alyzed, perhaps more complex cases will show some is- 
sues not yet considered. Hence, a proper definition of 
resolution-limit-free is called for, which we will develop 
in this paper. Furthermore, the question then is what 
methods will suffer from this resolution limit and which 
not. 

We will analyze this question within the framework of 
the first principle Potts model as developed by Reichardt 
and Bornholdt [20) . Various methods can be derived from 
this first principle Potts model, among them modular- 
ity, and we will briefly examine them. We will suggest a 
very simple alternative, which we term the constant Potts 
model (CPM). It can be easily shown that the CPM is 
resolution-limit-free according to our definition, but it 
will follow immediately from the more general theorem 
we will prove. Arguably, the CPM is the simplest formu- 
lation of any (non-trivial) resolution-limit-free method, 
and can be well interpreted. 

In the next section, we will briefly examine this first 
principle Potts model, review some models that can be 
derived from it, and introduce the CPM. We will then 
briefly explain the problem of the resolution limit when 
using modularity, followed by the introduction of the defi- 
nition of a resolution-limit-free method (i.e. not suffering 
from a resolution limit), and we will show some general 
properties of resolution-limit-free methods. We will then 
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prove which methods are resolution-limit-free and ana- 
lyze which are not. Finally, we show the CPM method 
performs superbly. 

II. POTTS MODEL FOR COMMUNITY 
DETECTION 

First, let us introduce the notation. We consider a 
connected graph G = (V,E) with n = \V\ nodes and 
to = \E\ edges. The adjacency matrix Ay = 1 if there 
is an (ij) edge, and otherwise. For weighted graphs 
the weight of a link is denoted by wy, while for an un- 
weighted graph we can consider wy — 1. We denote the 
community of a node i by Oi . 

In principle, links within communities should be rela- 
tively frequent, while those between communities should 
be relatively rare. Building on this idea, we will (i) re- 
ward links within communities; and (ii) penalize missing 
links within communities [2D]. In general, this can then 
be written as 

H = - ^(dijAij - bij(l - A ij ))S(a i ,a j ), (1) 

ij 

where 8(<Ji,(jj) = 1 if cr, = ctj and zero otherwise, and 
with some weights ay, fry > 0. Minimal H correspond 
to desirable partitions, although such a minimum is not 
necessarily unique. The choice of the weights ay and &y 
are important, and have a definite impact on what type 
of communities are detected. 



A. Previous methods 

In the current literature, at least four different choices 
exist (and presumably some other methods may be 
rewritten as such), leading to four different methods for 
detecting communities. We will briefly explicate these 
four different approaches. 

Reichardt and Bornholdt (RB) set ay = tuy — bij and 
bij = iRBPij with a new variable py that represents the 
probability of a link between i and j, known as the ran- 
dom null model. Working out their choice of parameters, 
we arrive at 

Urb = -J^AjjWij - jRBPij)8(<Ti,<Tj). (2) 

ij 

One of the most used null models is the so-called configu- 
ration model, which is Pij = kikj/2m, where fej = ■ Aji 
is the degree of node i. By using this null model, and set- 
ting jrb — 1 we recover the original definition of mod- 
ularity 9 . Independent of the exact choice of the null 
model , it can be shown the method will suffer from a 
resolution limit |16j . which thereby also holds for modu- 
larity [14] . 

Another approach by Arenas, Fernandes and Gomez 
(AFG) uses self-loops in order to try to circumvent the 



resolution limit [T5]. They do not explicitly derive their 
model based on the first principle Potts model, but it 
can easily be done. If we set ay = u)y — b^ and &y = 

Pij(r)-r5(i,j), wither) = (i)i ^ r + '' ) and S(i,j) = I 
only if i = j and zero otherwise, we arrive at their model 
(up to a multiplicative scaling) 

T~Lafg = ~ ^ (AjWjj + r<% - p ij (r))6(a i ,a j ) (3) 

ij 

The null model py (r) is here defined as the configuration 
model on the graph where a self-loop with weight r is 
added to each node. 

Ronhovde and Nussinov (RN) do not include any ran- 
dom null model, in order to avoid issues with the reso- 
lution limit, and in general set ay = toy and bij = 'Jrn 
(although for specific networks, such as with negative 
weights, they allow some minor changes). Working this 
out we obtain 

Urn = -}XAij(Wij +Jrn) -jRN)S((Ti,(Tj). (4) 

ij 

Finally, the label propagation method [21 can 
be shown to be equivalent to the Potts model 
— Sy AijWijd(<Ti,<jj) [22], which corresponds to the 
weights ay = icy and fey = 0. This is the least interest- 
ing formulation, since there is only one global optimum, 
namely all nodes in a single community, which is trivial. 
However, the local minima could be of some interest. 

It is not surprising then that these four different for- 
mulations share certain characteristics for some choice 
of parameters. The RB model is equivalent to the RN 
model up to a multiplicative constant by using an Erdos- 
Renyi (ER) null model, i.e. = p and by setting 

Irn = Jrbp/(1 - Irbp)- For -y R N = the RN model 
obviously reduces to the label propagation method. Fi- 
nally, for the AFG model, when using r = we retrieve 
the modularity (i.e. the RB model with configuration 
null model and jrb = 1). 

B. Constant Potts model 

We introduce an alternative method, that uses slightly 
different weights. By defining ay = u;y — &y and 6y = 7, 
we obtain a version that is similar to both the RB and 
the RN model, but is simpler and more intuitive to work 
with. If we work this out, we obtain the rather simple 
expression 

n = -*y](AijWij - j)S((7i, Uj). (5) 

ij 

Let us call this the constant Potts model (CPM), with the 
"constant" here referring to the comparison of Ay to the 
constant term 7. It is clear that this is equivalent to the 
RN model for unweighted graphs by setting 7 = fjS^- 
and ignoring the multiplicative constant. Furthermore, 
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it is equal to the RB model when setting 7 = jrbP for 
the ER null model. By setting 7 = we retrieve the 
label propagation method. Also, it is highly similar to 
an earlier Potts model suggested by Reichardt and Born- 

holdt ng. 

If we denote the number of edges^ inside community 
c by e c = AijWij8(<Ji, c)5(<jj, c), and the number of 
nodes in community c by n c = J^i 8{o~i, c), we can rewrite 
Eq. |5| as 

U = -J2^-in 2 c . (6) 

c 

In other words, the model tries to maximize the number 
of internal edges while at the same time keeping relatively 
small communities. The parameter 7 balances these two 
imperatives. In fact, the parameter 7 acts as the inner 
and outer edge density threshold. That is, suppose there 
is a community c with e c edges and n c nodes. Then it is 
better to split it into two communities r and s whenever 

where e r ^ s is the number of links between community r 
and s. This ratio is exactly the density of links between 
community r and s. So, the link density between com- 
munities should be lower than 7, while the link density 
within communities should be higher than 7. This thus 
provides a clear interpretation of the 7 parameter. 

In general, where 7 = min^ AijWij the optimal solu- 
tion is the trivial solution of all nodes in one big commu- 
nity. On the other extreme, when 7 = max^ A^Wij, it is 
optimal to split all nodes in communities, i.e. such that 
each node forms a community by itself. In fact, communi- 
ties of one node only exist when 7 = maxjj AijWij, since 
otherwise it will always be beneficial to put the node in 
one of its neighbors' communities. Hence, for practical 
purposes min^ AijWij < 7 < maxjj A^Wij. 

III. RESOLUTION LIMIT 

Traditionally the resolution limit is investigated by an- 
alyzing the counterintuitive merging of communities |14j , 
for example cliques or some smaller communities that are 
only sparsely interconnected as displayed in Fig. [TJ The 
RB model with a configuration null model for example 
will merge two neighboring cliques in this ring network 
of cliques when [16] jrb < q/(n c (n c — 1) + 2), where q 
is the number of cliques and n c is the number of nodes 
of a clique. Since the number of cliques q is a global 
variable, it shows modularity might be "hiding" some 
smaller communities within larger communities, depend- 
ing on the size of the network. Indeed in |14j it was sug- 
gested to look at each community to consider whether 
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it had any sub communities. Some different, though re- 
lated, problems with modularity were noticed in [24j and 
more recently in [25] . 

The AFG model considers self-loops of a certain weight 
to overcome this problem [TS]. Yet the model still de- 
pends on a null model, and so it is not surprising to find 
that the merging still depends on some global parame- 
ters. The implicit inequality for merging two cliques in 
the ring network of cliques is q > n c (n c — 1) + 2 + n c r, 
which for jrb = 1 and r = matches the previous re- 
sult. Although the resolution parameter r might be used 
to investigate the community structure at various scales 
similar to the jrb resolution parameter, it does not fun- 
damentally address the issues of the resolution limit. 

The RN model, on the other hand, will only join two 
cliques when [T3] ^ RN < l/(n 2 — 1), which does not de- 
pend on the number of cliques q, and depends only on 
the local variable n c , so is argued not to suffer from any 
resolution limit. For the CPM suggested here, we arrive 
at the condition 7 < 1/n 2 ., which also does not depend 
on the number of cliques q and can hence also said to be 
resolution-limit-free. More general, CPM favors to clus- 
ter r consecutive cliques instead of r — 1 at the point 
when 7 < 2/(r(r — l)n 2 ). 

However, it remains somewhat unclear what is meant 
exactly by resolution-limit-free in the above discussion, 
and the label resolution-limit-free requires a more precise 
definition. Consider for example that we take away the 
dependence on the number of links in the configuration 
null-model, so that we take pij = kikj. Notice that this 
only corresponds to a multiplicative rescaling of jrb by 
2m. Revisiting the case above, we come to the conclusion 
that cliques are separated whenever 2^/rb > {n c (n> c — 
1) + 2)~ 2 , which unsurprisingly no longer depends on any 
global variables. By the argument employed previously, 
the method should be resolution-limit-free. 

Not all problems have disappeared however. Suppose 
we take the subgraph consisting of only two of these 
cliques. We analyze when the method would merge the 
two cliques in this subgraph, which is the case whenever 
%7rb < (n c (n c — l) + l)~ 2 . Even though neither inequal- 
ity depends on any global variables, a problem remains. 
Combining the above two inequalities, we obtain that 
whenever 

(n c (n c - 1) + 2)- 2 < 2 lRB < (n c (n c - 1) + 1)~ 2 , 

the method will separate the cliques in the larger graph, 
yet merge them in the subgraph. 

The above discussion motivates us to consider the fol- 
lowing definition of a resolution-limit-free method. The 
general idea is that when looking at any induced sub- 
graph of the original graph, the partitioning results 
should not be changed. In order to introduce this defi- 
nition, let % be any objective function (which we want 
to minimize), we then call a partition C for a graph %- 
optimal whenever "H(C) < H(C') for any other partition 
C. We can then define resolution-limit-free as follows. 
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FIG. 1. (Color online) The problem of the resolution limit with modularity is usually demonstrated on a ring network of cliques. 
The cliques are as densely connected as possible, and as sparsely connected between them, while still retaining a connected 
graph. The resolution limit is said to arise because it will merge cliques depending on the size of the network. In fact, methods 
that do not suffer from the resolution limit, i.e. resolution-limit-free methods, may merge these cliques also. However, the 
distinguishing fact between resolution-limit and resolution-limit-free methods is that the first will detect smaller subcommunities 
when applied to the subgraph (i.e. not the same subpartition), while the latter will not detect smaller subcommunities (i.e. the 
same subpartition will remain optimal on the subgraph). Of course, whether the communities should consist of only cliques 
or of multiple joined cliques will still depend on the actual resolution of the method. For CPM this resolution parameter is 
designated by 7. So, for a particular value of 7 the cliques will be separated, while for another value they will be merged. When 
analyzing the property of resolution-limit-free more in detail, we will investigate this ring of clique network more closely. The 
actual weights a%j and bij need to be the same on isomorphic graphs, which restricts the possible number of different weights. 
We denote these different possibilities with a's for the present links (thicker lines) and j3's for the missing links (thinner lines). 
Please refer to the main text for further details. 



Definition 1. Let C — {Ci, C2, . . . , C q } be a H-optimal 
partition of a graph G. Then the objective function % is 
called resolution-limit- free if for each subgraph H induced 
by T> C C , the partition T> is also H-optimal. 

Furthermore, we introduce the notion of additive ob- 
jective functions. 

Definition 2. An objective function % for a partition 
C = {Ci, . . . ,C q } is called additive whenever H(C) = 
^2iH(Ci), where W(Ci) is the objective function defined 
on the subgraph H induced by Ci. 

If we have an "H-optimal partition C for an additive 
resolution-limit-free objective function H, we can replace 
subpartitions of C by other optimal subpartitions. 

Theorem 1. Given an additive resolution-limit- free ob- 
jective function W, let C be an H-optimal partition of 
a graph G and let H C G be the induced subgraph by 
T> C C. If V is an alternative optimal partition of H 
then C = C \ "D WD' is also %-optimal. 

Proof. Define C and V as in the theorem. By additivity, 
H(C') = H(C \V) + H{V), and by optimality H{V) < 
H(V). Since also H(C) = H(C \ V) + U(V) we obtain 
H(C) < H(C), so C is also optimal. □ 

Although, this might seem to contradict the NP- 
hardness of community detection methods, this is not 
the case. It states that when there are two optimal par- 
titions, any combination of those partitions are optimal, 



so in a certain sense, they are spanning a space of opti- 
mal partitions. It does not say whether such a partition 
can be easily found. Also, there might be two optimal 
partitions that cannot be obtained by recombining them, 
because all communities partly overlap with each other. 

It is also possible to prove that a complete graph K n 
with n nodes is never split (unless into all nodes sepa- 
rately) . 

Theorem 2. Given a resolution-limit- free objective 
function H, the H-optimal partition of K n for all n is 
either only one community, namely all nodes, or n com- 
munities consisting each of one node. 

Proof. Assume on the contrary there is an optimal parti- 
tion C of K n such that 1 < |C| < n. Then for any V C C 
the subgraph H induced by I? is a complete graph. But 
by assumption, T> is then not optimal, and by rcsolution- 
limit-free, C is then not optimal. Hence, inductively, the 
theorem must hold for all n. □ 

Also notice that a resolution-limit-free method will 
never depend on the size of the network to merge cliques 
in the ring of cliques network. This can be easily 
seen from the fact that a subgraph of a large ring of 
cliques network also appears in a smaller ring. So if the 
method would merge cliques in some large graph, by the 
resolution-limit-free property, it would also need to merge 
them together in the smaller graph. Hence, the actual 
merging cannot depend on the size of the network. In 
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this sense it captures this prior concept of the resolution 
limit. 

Equipped with this definition, we can analyze the first 
principle Potts model further. For example, what con- 
ditions should be imposed on the weights and bij in 
Eq. for the method to be resolution-limit-free? Would 
a method that takes into account the local number of tri- 
angles be resolution-limit-free? Or would it be possible 
to use the shortest (weighted) path for example? 

We can prove that CPM is resolution-limit-free in this 
sense, just like the RN model and the LP model. The 
CPM model is also trivially shown to be additive by 
Eq. Q. Perhaps it is less obvious, but the RB model 
is not additive, since it cannot be defined in terms of 
independent contributions, i.e. the contribution H(Ci) 
per community depends on the whole graph G, instead 
of only on the subgraph H induced by Cj. Nor is the RB 
model resolution-limit-free according to our definition, 
regardless of the null model [16 , and hence modularity is 
not resolution-limit-frcc. Furthermore, as we have seen, 
also when using pij = kikj the model is not resolution- 
limit-free. Finally, the AFG model is not resolution-limit- 
free either. 

Since the CPM model is also related to the RB model 
using the ER null-model, it is tempting to conclude it is 
also resolution-limit-free. Indeed, this might be said to 
be the case, if we choose p independently of the graph, 
i.e. not define it as p — m/n(n — 1), and simply choose 
it as some value p € R. However, we then obviously 
retrieve the CPM model. This shows that resolution- 
limit-free methods are strongly constrained, and there is 
only a fine line between resolution-limit and resolution- 
limit-free methods. 

This follows from the more general theorem we will 
now prove. For this, we first introduce the notion of 
local weights. Again, building on the idea of subgraphs, 
we define local weights as weights that do not change 
when looking to subgraphs. 

Definition 3. Let G be a graph, and let ay and bij as in 
Eq. ([I]) be the associated weights. Let H be a subgraph of 
G with associated weights a[j and b[j . Then the weights 
are called local if ay = Aa^ and bij = Xb'^ , where A = 
A (if) > can depend on the subgraph H . 

Clearly then, the RN and CPM model have local 
weights, while the RB and AFG model do not. This 
definition says that local weights should be independent 
of the graph G in a certain sense. In fact, it is quite 
a strong requirement, as it should even hold for a sin- 
gle link (ij) in the subgraph where only i and j are in- 
cluded. That means it can not depend on any other link 
but the very link itself. Since for missing links, there is 
(usually) no associated weight or anything, it can only 
be constant. There are some exceptions, such as multi- 
partite networks, or networks embedded in geographical 
space [261 EZ], where some sensible non-constant local 
weights can be provided. Hence, the RN model and the 
CPM model are one of the few sensible options available 



for having local variables. We can now prove the more 
general statement that methods using local weights are 
resolution- limit- free. 

Theorem 3. The objective function H as defined in 
Eq. ([I]) is resolution-limit-free if it has local weights. 

Proof. Let C be the optimal partition for G with com- 
munity assignments c,, T> C C a subset of this partition, 
and H the subgraph induced by T> with h nodes. Fur- 
thermore, we denote by di the community indices of T>, 
such that di = Ci for 1 < i < h and by A' the adjacency 
matrix of H, so that Aij — A\j for 1 < i < h. Assume T> 
is not optimal for H, and that T>* is optimal, such that 
H(V) > "H (£>*). Then define c* by setting c* = d* for 
1 < i < h and c* — Ci for h < i < n. Then because 
the result is unchanged for the nodes h < i < n, we have 
that 

AH = H(C) - H(C*) = ~(H(V) - H(V*)) > 
A 

where the last step follows from the locality of the weights 
and bij . This inequality contradicts the optimality of 
C. Hence, for all induced subgraphs H, the partition D is 
optimal, and the objective function % is resolution-limit- 
free. □ 

The converse is unfortunately not true. Consider a 
graph G with some weights and bij . Then pick a sub- 
graph H induced by some subpartition T>, and define the 
weights a\j = aij and b[j = bij except for one particular 
edge (kl), for which we set a' kl — ajy -I- e. Then for some 
e > 0, the original subpartition will remain optimal in H, 
while the weights are not local. Since the small change of 
the weight is only made when considering the graph H, 
all other subpartitions will always remain optimal. Of 
course, such a definition of the weight is rather odd, so 
in practice we will never use it. 

Even though the converse is not true, we can say a 
bit more. The weights can be a bit different indeed, but 
there is not that much room for these differences. We 
demonstrate this on the ring network of cliques. The 
weights can depend only on the graph, so if G and G' are 
two isomorphic graphs, then ay(G) = ai'j>(G'), where i 
and i! are two isomorphic nodes. Hence, only a number 
of weights can be different from each other in the ring 
network, as illustrated in Fig.[T] All nodes within a clique 
are isomorphic, except the node that connects to other 
cliques. So, all the edges among those n c — 1 nodes are 
similar, and will have the same weight a±. All edges 
from these n c — 1 nodes to the "outside" node will have 
the same weight a^. Finally, the edge connecting two 
cliques is denoted by a^. The missing self-loop for the 
special outside node is denote by /32 while the missing 
self-loop for the other nodes in the cliques is denoted 
by Finally, there is (1) a missing link between the 
outside node and a normal node denoted by ^3; and (2) 
a missing link between two normal nodes, denoted by /3 4 . 
These weights are illustrated in Fig. [I] 
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Let us now analyze when the method will not be 
resolution-limit-free. Then, the cliques must be merged 
in some (large) graph, while for the subgraph consisting 
of these two merged cliques, they should be separated by 
the method. Or conversely, they should be separated in 
some (large) graph, but merged in the subgraph. We can 
write the 1-L S for all q cliques being separate as 

U s = -q{a x {n c - l)(n c - 2) + 2a 2 (n c - 1) 

- (n c - 1)0! - p 2 ) 

and Um for merging all two consecutive cliques as 

H m = -|2(ai(n c - l)(n c - 2) + 2a 2 (n c - 1) 

- (n c - l)/3i -p 2 + a 3 - fh(n c - 1) - /3 4 (n e " I) 2 ) 

Furthermore, for the induced subgraph H consisting of 
two consecutive cliques, we can write U' s for separating 
the two cliques and W m for merging them, similarly as 
before, where a' and j3' are the weights for the subgraph 
H. Then the method is not resolution-limit-free if it 
would merge the two cliques at a higher level (i.e. when 
H m < Hs) yet would not merge them at smaller scale 
(i.e. when H' s < H' m ), or vice versa. Working out this 
condition for U. m < H. s (and similarly for H m > H s ) 
gives us 

a 3 > (n c - l)(ft(n c -l) + fo), 

while for W s < H' m (and similarly for H' s > H' m ) we 
obtain 

a' 3 <(n c -l)(/3' i (n c -l)+l3' 3 ). 

Combining these two inequalities for both cases we obtain 

a' 3 (Mn c - 1) + /3 3 ) < o 3 C^(nc - 1) + fls), (7) 
a'Mn c - 1) + /3 3 ) > a 3 (#(n c - 1) + (8) 

where either Eq. Q or {[sj) should hold. Hence, only if 
the left hand side equals the right hand side, it does not 
constitute a counter example. Working out this equality, 
there are two possibilities. Either the weights should be 
local, or the following equality should hold 

Obviously, this again constitutes some very particular 
case of non-local weights. We can repeat this same pro- 
cedure for other subpartitions, and for other graphs, 
thereby forcing the weights to be of a very particular 
kind. This thus leaves little room for having any sensible 
non-local definition such that the method is resolution- 
limit-free. 

This means resolution-limit-free community detection 
has only a quite limited scope. In fact, the CPM seems 
to be the simplest non-trivial sensible formulation of any 
general resolution-limit-free method, although there is 



some leeway for special graphs (i.e. having some node 
properties, such as multipartite graphs). This is not to 
say that methods with non-local weights (e.g. modular- 
ity, AFG, number of triangles, shortest path, between- 
ness) should never be used for community detection at 
all, they are just never resolution-limit-free. 



IV. PERFORMANCE 

In order to assess the performance of the proposed 
CPM model, we performed various tests. Using the latest 
suggested test networks 13j we find that the CPM model 
and the accompanying algorithm is both very accurate 
and efficient. More details on the efficient Louvain-like 
algorithm, the test procedure and the calculations on the 
resolution parameters can be found in the appendix at 
the end of this article. 

We have examined both directed test networks as well 
as hierarchical test networks, where communities exist 
at multiple levels in the data. Communities become less 
discernible for higher values of the parameter fi of having 
links outside its community. For hierarchical communi- 
ties, there are two such parameters: [i\ for the first level 
(the large communities), and /i 2 for the second level (the 
subcommunities) . These mixing parameters allow us to 
calculate what the inner and outer densities of commu- 
nities are. We exploit this fact to calculate the proper 
7 = 7* in order to investigate the performance of the 
CPM, and similarly jrb — 1*rb f° r * ne model using 
the configuration null model. This way, the results do 
not depend on any particular method to determine the 
correct parameter, which represents another challenging 
problem. 

Some of the earlier algorithms and models that showed 
excellent performance |28j are the Louvain [7] method for 
optimizing modularity, and the Infomap method [23]. In 
Fig. [2] we have displayed the results for (1) the CPM 
model; (2) the RB model using an ER null model (i.e. 
CPM with 7 = p) ; (3) the RB model using the con- 
figuration null moderlwith "corrected" parameter value 
Irb'' (4) the modularity model, (i.e. RB using the con- 
figuration null model and jrb = 1); and finally (5) the 
Infomap method. We have performed tests on networks 
having n = 10 3 and n = 10 4 nodes, with a degree distri- 
bution exponent of 2 (with average degree 15 and max- 
imum degree 50) and community size distribution expo- 
nent 1 (with community sizes ranging from 20 to 100). 
Per value of /i 100 graphs were used to obtain this result. 

It can be clearly seen that CPM performs extremely 
well. The difference in performance of the CPM model 
in comparison to the RB model using the ER null model 
is especially striking. This is not a consequence of the 



2 Since we use directed test networks, we use a small adjustment 
to use the directed configuration null model 1301 
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FIG. 2. (Color online) Performance of various community de- 
tection methods in terms of Normalized Mutual Information 
(NMI) depending on mixing parameter fi. Results are shown 
for (1) CPM, using the calculated resolution parameter 7*; 
(2) the RB model with an ER random null model (i.e. cor- 
responding to CPM using 7 = p); (3) the RB model with a 
configuration null model with a calculated resolution param- 
eter 7jj S ; (4) modularity, in other words, the RB model with 
a configuration model using ^rb = 1; and finally (5) the In- 
fomap method. The open symbols denote results for n — 10 3 
and the closed symbols for n = 10 4 . 

method being resolution-limit-free or not, but it rather 
depends on choosing the correct resolution parameter. 
Obviously then, setting 7 = p is in general not a very 
good strategy, and for general networks one should care- 
fully analyze at which resolution the network contains 
meaningful partitions. 

A similar effect also shows for modularity (or the RB 
model using the configuration model), such that when 
jub is chosen appropriately (i.e. using 7^5 = j RB ) the 
method will perform better than at the ordinary resolu- 
tion jrb — 1- Indeed, the results of the CPM model and 
the RB model using the configuration null model using 
1*rb are rather comparable, although the latter's perfor- 
mance drops less quickly, and then outperforms CPM. In- 
terestingly, when we use the ordinary resolution jrb = 1, 
it becomes more difficult to detect communities in large 
networks using the configuration model. This constrasts 
with the results when we choose the appropriate resolu- 
tion parameter 7* , 7^ and indeed also for the Infomap 
method. Indeed it can be shown that the communities 
should become more clearly discernible for larger net- 
works when the community sizes remain similar. 

Surprisingly, both methods outperform the Infomap 
method, which performed superbly in previous tests [28], 
when the appropriate resolution parameter is chosen. 
This show that determining the correct or meaningful 
resolution is an important issue. This remains a chal- 
lenging problem, and various methods have been pro- 
posed to do so [J] , for example by looking at the stability 
of multiple (randomized) runs of an algorithm [191 131] , 
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FIG. 3. (Color online) Performance of CPM and Infomap 
on a hierarchical benchmark network in terms of Normalized 
Mutual Information (NMI) depending on mixing parameters 
/ii and fi2- The networks had n = 10 4 nodes with a degree 
distribution exponent of 2 (with average degree 20 and max- 
imum degree 50) and community size distribution exponent 
1 for both small (size ranging from 10 to 50) and large com- 
munities (size ranging from 50 to 300). Per combination of 
parameters 10 graphs were used to obtain this result. The 
resolution parameters 7 for the two different levels were cal- 
culated analytically for CPM. 



by looking for large ranges of the parameter over which 
the results remain stable [IS], investigating the stability 
when the network is slightly perturbed |32j or by looking 
at how significant the partition is compared to a graph 
ensemble [33] . 

We have also performed extensive tests on hierarchi- 
cal networks, where the method also performs well, and 
is able to extract the two different levels of communi- 
ties effectively, as displayed in Fig. [3] For relatively low 
H2 0.7, the first (larger) level becomes more clear for 
low fii, while the second (smaller) level becomes more 
clear for larger pb\ . This is both the case for a recent hier- 
archical version of the Infomap method [34] and the CPM 
method. The Infomap method seems to be slightly bet- 
ter at detecting the correct communities, but the CPM 
method remains highly competitive. The possibility for 
having various scales of description of the network seems 
important, as many networks seem to have at least some 
hierarchical structure. 



V. CONCLUSION 

Several community detection methods, among which 
modularity, are affected by the problem of the resolution 
limit. In this paper we have provided a novel rigorous 
definition of what it means for a community detection 
method to be resolution (limit) free. Most importantly, 
we are able to prove exactly which community detection 
methods are resolution-limit-free, namely those methods 
that use local weights. This also clarifies the relationship 
between local methods and the resolution limit. How- 
ever, we do not address the issue of determining an ac- 
tual meaningful resolution, which remains a challening 
problem. 

Moreover, there does not seem to be much room 
for having resolution-limit-free methods without local 
weights. Of the few possibilities available for having 
resolution-limit-free community detection, the constant 
Potts Model (CPM) we introduced in this paper seems 
to be the simplest possible formulation, and performs 
excellent. A rigorous definition of resolution-limit-frcc 
community detection allows for a more articulate analy- 
sis, and induces further progress on developing novel and 
meaningful methods. 
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Appendix A: Louvain like Algorithm 

The algorithm we employ is derived from the Louvain 
method [7] . We use the concept of node size, denoted by 
rii for a node i, initialized to n, = 1 (indeed the commu- 
nity size n c = Yli n i^(' 7 ii c ) ^ s related). We first iterate 
(randomly) over all nodes, and put nodes greedily into 
the community that minimizes Eq. Q . We subsequently 
create a new graph based on the communities, and new 
node sizes, and reiterate over this new smaller graph. 
More specifically: 

1. Initialize A^j = Wij, with w\j — 1 in the case of 
unweighted networks, and set rii = 1 for all nodes 
i. 

2. Loop over nodes i, remove it from its community 
and calculate for each community c the increase if 



we would put node i into community c, 
AH(oi = c) = -(e^ - 2-yrn ^ njSfa, c)), (Al) 
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where e^ c = Y^ji^j + Aji)8{<jj, c ) is the number 
of edges between node i and community c. We put 
node i into the community c for which AH^at — 
c) is minimal. We iterate until we can no longer 
decrease the objective function. 

3. We build a new graph A' cd = J2ij Aij5(<Ji, c)5(aj,d) 
and node sizes n' c = J2 i n.iS(cri, c). We repeat step[2] 
by setting A = A' and n = n' until the objective 
function can no longer be decreased. 

The implementation of the algorithm in C++ can be 
downloaded from the author's website: http : //perso . 
uclouvain.be/vincent .traag 

Notice that for resolution-limit-free methods, the re- 
sults should be unchanged on subgraphs. Hence, we 
could therefore perform the method recursively on sub- 
graphs. We suggest then the following improvement. 
First we cut the network at each recursive call, until the 
density of the subgraph exceeds 7. Then, we recombine 
the subgraphs, and loop over nodes/communities to find 
improvements until we can no longer increase greedily, 
and return to the previous recursive function call. These 
calls should be easily parallelized, making community de- 
tection in even larger graphs or in an on-line setting pos- 
sible by using cluster computing. 

Appendix B: Benchmark tests 

The benchmark networks are created using a known 
community structure, i.e. a planted community struc- 
ture. The community sizes n c are chosen from a distri- 
bution following a power-law Pr(n c = n) ~ n~ T ' 2 . The 
degrees ki of the nodes are also chosen from a power-law 
distribution Pr(fcj = k) ~ fc~ Tl . The stubs are then con- 
nected, with probability 1 — [i within a community, and 
with probability [i between two communities. A lower 
bound n c and upper bound TT C on the community sizes 
is imposed, while for the degree the average degree (k) 
is specified. For the hierarchical version, there are two 
levels, with the communities of the second level embed- 
ded in the first level. A fraction of [i\ of the links is 
placed between two different macro communities at the 
first level, while a fraction of /i2 of the links are placed 
between the small communities of the second level (but 
within the same large community). 

Instead of detecting the resolution algorithmically, we 
calculate the proper resolution parameter value 7 ana- 
lytically (and therefore, beforehand). In order to do so, 
we consider the following. The resolution parameter 7 
acts as a sort of threshold on inner and outer community 
density. If we were to set 7 equal to the inner density, it 
would be rather difficult to fulfill the condition that the 
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inner density should be higher than that, and similarly 
so for 7 equal to the outer density. So, we need to be as 
far as possible from both the inner density as well as the 
outer density, which would be simply the average of the 
two. 

The inner density for a community having n c nodes 
can be easily found as 



(i-c)W 

n r — 1 



(Bl) 



and the outer density (i.e. all the edges originating from 
a community to the outside) is 



Pout 



M (fc) 



(B2) 



where n is simply the total number of nodes. The average 
community size (n c ), which is proportional to 



(B3) 



Hence, each community should have a "expected den- 
sity" or "degree density" pi n = e c /[e c ] within communi- 
ties lower than 7, while the outer degree density should 
be lower between communities. Writing this out in terms 
of the configuration model, given the model of the test 
networks, we arrive at 



Pin 
Pout 



(1 - n)n 



n c 



(B5) 
(B6) 



These degree densities lack a clear interpretation, in 
contrast with CPM. Similar as before we simply set 
1*rb = \{{Pvn) + (Pout)) for the average community size 
<n c >. 

For comparing our results to the known community 
structure, we use the normalized mutual information. 
Given two different partitions C and D, the mutual in- 
formation I is defined as 



I(C,D) 



n 



logn- 



where n c is the minimal community size and W c the max- 
imal community size, than gives us the (pi n ) and (p ut) 
for the average community size. The best resolution pa- 
rameter is then 7* = \{(p in ) + (Pout))- 

For the hierarchical test networks, we can perform a 
similar analysis, and use the average of the inner and 
outer density, similar as before, for the two different lev- 
els. Ordinarily, the communities are assumed to exist 
whenever p m > p out . 

For modularity, we can also calculate similar bounds. 
When we define by e c the number of edges within com- 
munity c and by [e c ] the number of expected edges within 
a community, modularity can be written as 



u 



^2 e c~ 7fls[ e c]- 



(B4) 



with n r , s being the number of nodes that are in commu- 
nity r in partition C and in community s in partition D, 
while n r simply denotes the number of nodes in commu- 
nity r. The normalized mutual information I n (C,D) is 
then defined as 



In(C,D) = 



2I(C,D) 
H(C) + H(D)' 



where H(C) indicates the entropy of a partition C, which 
is defined as 



H(C) 



^ — ' n 



nc 
n 



The normalized mutual information < I n (C,D) < 1, 
with 1 indicating equivalent partitions. 
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